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Abstract— Recent studies have shown that joint coding is more 
efhcient and effective than independent coding for compression 
of multiple video programs |3]-[7]. Unlike independent coding, 
joint coding is able to dynamically distribute the channel capacity 
among video programs according to their respective complexities 
and hence achieve a more uniform picture quality. This paper 
examines the bit-allocation issues for joint coding of multiple 
video programs and provides a bit-allocation strategy that results 
in a uniform picture quality among programs as well as within 
a program. To prevent the encoder/decoder buffers from over- 
flowing and underflowing, further constraints on bit allocation 
are also discussed. 

Index Terms — Constraints for buffer protection, dynamic bit 
allocation, joint coding, multiprogram transmission, rate control, 
statistical multiplexing. 

I. Introduction 

WITH recent advances in digital video compression, such 
as MPEG-2 [I], and in digital transmission technology, 
conventional television channels are now capable of delivering 
several digitally compressed video programs. For example, by 
using MPEG-2 video coding, digital NTSC video programs 
can be compressed to 4-8 Mbits/s while providing an adequate 
picture quality. Digital transmission technology can support a 
payload of up to 19 Mbits/s in a 6-MHz terrestrial broadcast 
TV channel or 27—43 Mbits/s on cable TV channels. Hence, 
it is possible to transmit several digital television programs 
in the same bandwidth presently occupied by a single analog 
TV channel. 

Fig. 1 shows a block diagram of a multiprogram transmis- 
sion system in which several video programs are compressed, 
multiplexed, and transmitted over a single channel. Clearly, 
these programs have to share the channel capacity. In other 
words, the aggregate bit rate of the programs has to be equal 
to (or less than) the channel rale. This can be achieved by con- 
trolling either each individual program bit rate (independent 
coding) or the aggregate bit rale (joint coding). 

In independent coding, programs are coded independently. 
Each encoder, therefore, needs a separate rate control to 
maintain its bit stream at a constant bit rate (CBR), as shown in 
Fig. I. Rate control for independent coding can be performed 
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across the time and spatial dimensions of a program. There are 
two main shortcomings with independent coding at a constant 
bit rate. First, independent coding at a fixed rate may lead to 
large variations in picture quality among programs. Procram 
complexity typically varies significantly between programs. 
For example, news programs usually have a relatively static 
background and very little motion, while sport and music video 
programs may contain fast motions and rapid scene chances. 
As an example. Fig. 2 shows the peak signal-to-noise ratio 
(PSNR) for three video programs (Avers, Martin, and Hook) 
coded independently using MPEG-2 at a bit rate of 3 Mbits/s. 
It is clearly demonstrated that the picture quality level among 
these three programs is quite different. The second problem 
with independent coding at a constant rate is an inefficient 
use of the channel capacity. Not all scenes within a program 
need to be encoded at the same bit rate to achieve a similar 
picture quality. For example, relatively static scenes require 
much less bits than complex scenes with fast motion. If the 
bit rate for a given program is set sufficiently high to maintain 
a good picture quality for the most demanding segment, that bit 
rate is effectively wasted during the less challenging periods. 
Furthermore, if a fixed bit rate is used, different scenes will 
be rendered with unequal quality, as shown in Fig. 2. where 
the PSNR for each program also varies with time. 

Unlike independent coding, joint coding only needs to 
maintain the aggregate bit-rate constant, while allowing the 
bit rate of each individual program to vary, as shown in 
Fig. 3. Rate control for joint coding is therefore extended 
to an additional dimension, that is, the program dimension, 
which implies more freedom in allocating the channel capacity 
among programs and hence more control over the picture 
quality among programs, as well as within a program. For 
example, joint coding can move bits from the less active 
programs to the more active ones in order to maintain a 
uniform picture quality. Recent studies have shown that joint 
coding is more efficient and effective than independe nt eodin« 
for multiprogram video coding [3]-[7J. In fact, the study in 
[5] shows that in joint coding, as the number of programs 
increases, the compressed bn streams exhibit characteristics 
similar to those of variable bit-rate compression, which is 
known to be more efficient than CBR compression [3], [7]. 

This paper examines the bit-allocation issue for joint cod- 
ing of multiple video programs. In Section II, we present a 
bit-allocation strategy for uniform picture quality. The bit- 
allocation strategy is of a hierarchical nature. At the very top 
level, we introduce the concept of a super group of pictures 
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Fig. 1. Independent coding: programs are coded separately, each with a separate rate control. 
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(GOP) in Section II- A. The input video programs are divided 
into super GOP's (hat are identical in terms of the number 
of frames of each type (I, P. and B). The super GOP's are 
all assigned the same number of bits. In Section II-B, we 
define a super frame as a collection of frames, one from 
each of the programs taken at the same time instant. Given a 
number of bits for a super GOP, we formulate the bit-allocation 
strategy for each super frame within the super GOP. To prevent 
the encoder buffer from overflowing and underflowing, we 
impose constraints on the target number of bits for each super 
frame, as discussed in Section 1I-C. In Section U-D, the same 
methodology is extended to the frame level, where each regular 
frame receives a target number of bits that is proportional 
to its complexity measure. To ensure that the decoder buffer 
never overflows or underflows and to limit the bit rate of each 
individual program within a specific range, we apply additional 
constraints on the target number of bits for each regular frame. 
In Section 111, we report the simulation results. 



II. Dynamic Bit Allocation 

Assume that all the encoder engines in joint coding (Fig. 3) 
use MPEG-2 video coding [1 ). The MPEG-2 standard defines 
three main picture types in terms of temporal processing: 
I (intraframe coded) pictures, P (forward predictive coded) 
pictures, and B (bidirectional predictive coded) pictures. The 
input video frames are organized into GOP's; each may contain 
one I-picture, a number of P-pictures, and optionally some B- 
pictures between I- or P-pictures. Fig. 4 shows an example 
where the input video sequence is divided into GOP's of 
15 frames: one I-picture. four P-pictures. and two B-pictures 
between I- or P-pictures. 

A. Super GOP and Target Rate 

Assume that L video programs are to be delivered over 
a network with a fixed channel rate /trhannei. as shown in 
Fig. 3. The L programs can have any GOP structure, thai is. 
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Fig. 3. Joint coding: programs aie coded jointly; a joint rate control is implemented. 
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the program GOP length as well as the distance between I- or 
P-pictures can be different from program to program. 

We stan by conceptually dividing the L programs into 
identical groups, so-called super GOP(L : /V), as shown in 
Fig. 5, in terms of the number of frames of each picture type. 
Mere, L is the number of programs and N is the length of super 
GOP's. Since these super GOP's contain the same number 
of frames of each picture type, they are assigned the same 
number of bits. Clearly, each super GOPfL. A r ) contains LxN 
frames. As a matter of fact, the value of ;V is not unique, but 
from an implementation point of view, small super GOP's 

are preferable. Let >\' h I = I.2 L be the GOP length 

lor program /. We set N equal to the least common multiple 
(LCM) of :Y,. / = I. 2. .... i.e.. 



N = LCM(N l , X 2 .... t N L ) 



(!) 



where ;V is the smallest number that can be divided by 
all A'/. / = 1, 2. .... L. Hence, the super GOP'sJ A r ) are 
the smallest identical groups containing the same number of 
frames of each picture type. For example, if there are two 
different program GOP lengths for these L programs, say 9 
and 15, we will have a super GOP length of 45 frames. 

A super GOP always contains an integer number of GOP's 
of a program. The GOP's of individual programs, however, do 
not have to be. aligned with each other or with the super GOP. 
A super GOP boundary may cut through a program GOP, or 
a super GOP may include fractional portions of the GOP's 
of a program at the boundaries. The fractional portions of 
GOP's of a program included inside a super GOP. at the super 
GOP's left and right boundaries, can however be considered as 
a "complete" GOP. Fig. 6 shows an example of a super GOP 
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Fiz 6. A super GOP may include fractional portions of GOP's of a program 
at "boundaries. The fractional portions of GOP's of a program included inside 
a super GOP. at super GOP boundaries, can be considered as a "complete" 
GOP A super GOP contains an integer number of GOP's of each program. 



with nonaligned program GOP's. As seen, the boundaries of 
the GOP's of program 2 do not coincide with the boundaries 
of the super GOP. However, the two fractional parts of the 
GOP's included inside the super GOP, at the super GOP's left 
and right boundaries, actually contain the same numbers of 
P-. and B-pictures as a complete GOP. Furthermore, if all 
the programs have the same GOP length, say, A 7 , the super 
GOP length will also be equal to A 7 , regardless of whether the 
program GOP s have the same patterns of I-, P-, and B-pictures 
or whether they are synchronized or not. In any case, super 
GOP's contain the same number of frames of each picture 
type from one of programs. 

Since all super GOP's(L, A r ), with N defined by using (1) 
contain the same number of I-, P-, and B-pictures, they can 
be assigned the same number of bits T, i.e., 



T - A'^ r hannel(bpf) 



where ^ C hannd(b P f) = ch(nmeLrate/ Jrame.rate is the 
average number of bits per frame. 



B. Super Frames and Targe! Rate 

Given the target number of bits 7\ for a super GOP(L, A 7 ), 
the next step is to determine the distribution of T over the 
frames within a super GOP. We further define a super frame 
as a collection of L frames, one from each of the L programs 
taken at the same time instant, as shown in Fig. 7. Clearly, 
a super GOP contains A r super frames. Note that these L 
programs can have any GOP structure, and the L frames in a 
super frame may be of any picture type. In the following, we 
will determine a target number of bits for a super frame and 
then a target number of bits for a regular frame according to 
the program complexities. 

As in MPEG-2 Test Model 5 (TM5) [2], we define a 
complexity measure C for a frame as the product of the 
quantization Q used for the frame and the number of bits 
R generated for the frame using Q, i.e., 



Lei Q/,n.f and R LnJ be. respectively, the quantization param- 
eter used for frame v. of program / and the corresponding 
number of bits generated for the frame using Qi. u where 1 
indicates the picture type 1. P. and B. For a super frame n. 
there can be L different frame complexity measures, one for 
each reeular frame, i.e.. 



(4) 



Let T n be the target number of hits for super frame n. 
The total number of bits generated from the L regular frames 
within super frame ?i should be equal (or close) to T„. i.e.. 



T „ = <r -E ^ 



1=1 



(5l 



Similarly, the total number of bits generated for all the super 
frames in a super GOP should be equal (or close) to the target 
number of bits T assigned for each super GOP. i.e., 



(6) 



Since one of the objectives of joint coding is to achieve a 
uniform picture quality, ideally, the same quantization param- 
eter should be applied to all the frames (note that quantization 
is the only lossy operation in MPEG-2 encoding and plays a 
critical role in controlling both the picture quality and the bit 
rate). However, in order to take into consideration the different 
picture types (I, P, and B). as in TM5, we introduce a constant 
weighting factor A', n>f for each picture type /, i.e.. 



(2) where 



A', 

A'/.n.t = { A> 

A B 



for I picture 
for P picture 
for B picture. 



(7) 



(Ri 



Note that t corresponds to the picture type, I. P. or B. in TM5. 
A'i = A> = 1 and A'b = 1-1. From (5) to (7), we can obtain 
the target number of bits for super frame 7/. T u as 



E 



G. 



T„ = 



(9) 



EE 



A'i.,,,1 



C = /£(?. 



(3) 



where the numerator is the sum of the complexity measures 
for the L regular frames in super frame n and the denominator 
is the sum of the complexity measures for all the frames in 
the current super GOP. Clearly, the target number of bits for 
super frame n.T Tl: is proportional to its complexity. 

The problem with the above bit-allocation equation is that in 
computing a target number of bits for a super frame, we need to 
have the complexity measures for all the L x A r frames within 

the current super GOP(L.A : ). i.e.. £•/,„.,./ = 1.2 L and 

7? = 1.2, A\ which may not be very practical. We now 

make two modifications to simplify the calculation. First, at 
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each super frame 77.. we only consider the distribution of the 
remaining biis T r defined as 



(10) 



l=\ 



over the remaining super frames from n to N in the super 
GOP(L,A"). Here. »s the number of bits gen- 

erated from super frame ?i - 1. This leads to the bit allocation 
for super frame n as 



T — 

Tl .V 



E 



W.n.t 



Ct.n.t 



(ID 



The complexity measures for the previous frames from 1 to 
n - 1 are now no longer necessary in computing the target 
number of bits for super frame n,T u . At the start of processing 
a new super GOP. T r needs to be reset as follows: 



T r - T 



(12) 



where T is the target number of bits for a new super GOP 
(2) and T r on the right of the equation is actually the number 
of bits left over from the previous super GOP, which can be 
either a positive or negative number. 

Secondly, we assume that all the future frames of the same 
picture type in a program have the same complexity measure, 



i.e.. 



< N 



(13) 



which is a reasonable assumption for continuous scenes. Now, 
for each program, we only need to keep three complexity 
measures, one for each of the three picture types I, P, or B, 
i.e., G.i.Ci.p. and C'/.u- The bit-allocation strategy for super 



frame n therefore becomes 
l 



T n = 



T r (14) 



where: 

1) Oi % n tt is the complexity measure for frame n of picture 
type t 6 {I.P.B} for program /. After encoding frame 
n, the complexity measure for the corresponding picture 
type t G {LP.B} of program l.Ci.tz \,p.B) is updated. 
It can be estimated/calculated based upon the average 
quantization parameters used for the frame and the 
number of bits generated for the frame (3i. The updated 
complexity measures arc then used in calculating the 
target rates for the next frames. 

2) Ki.n.t is a constant factor used to compensate for the 
picture type /. ~ {I.P,B} of frame n of program /. It 
can be either h'\. A> or A'o- depending upon the picture 
type. 

3) iV/j. jV/ t p. and A"/b are. respectively, the remaining 
numbers of 1, P. and B pictures for program / in the 
current super GOP at super frame n. 

The numerator on the right side of (14) is the sum of 
complexity measures for all the frames in super frame It 
can be considered as a complexity measure for super frame 
n. The denominator can. on the other hand, be considered 
as a complexity measure for the entire set of the remaining 
frames in the super GOP. Hence, we can state that ( 14) actually 
assigns a super frame a target number of bits proportional to 
the super frame's complexity measure. 

C. Constrain! on Super Frame Target Rule 

For transmission over a constant bit-rate channel, the aggre- 
gate bit stream needs to be smoothed out using the encoder 
channel buffer, as shown in Fin. 3. The encoder buffer fullness 
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Fig. 8. Packets of selected program are extracted and decoded at the decoding end. 



at frame n,B e n (Fig. 3) can be calculated as [8] 



hannel(bpf). 



(15) 



Lei B^ &x ^ e maximum encoder buffer size. To ensure that 
the encoder buffer never overflows or underflows, the buffer 
fullness B c n has to be constrained within the range [0. B e m&x }, 
i.e., 



0 < Bl < Bt 



(16) 



From (15) and (16), we have 



or 



^channe)!.bpf) ~ #n-l ~Y J ^'' n ' t - ^channel(bpf ) 



+ B° - B 



(18) 



This is a constraint on the total number of bits generated for 
super frame n for a given channel rate i? C hannei(bpf)- If me 
aggregate rate can be controlled to meet the target rate, i.e., 



T » = Y ^ 



(19) 



the constraint on the total number of bits for a super frame 7i 
becomes the constraint on the target number of bits for super 
frame, i.e., 

^charmeHbpfl " #n - 1 < ^» ^ ^channe](bpf ) + ^max ~ ^n-T 

(20) 

Hence, before starting to encode each super frame n. we need 
to check if its target rate determined by (14) is in the proper 
range, and if not, we adjust it as follows: 

f S?rhatirjeKbpr; ~ 



if T u < /?char.nel(bpf) ~ &n 



■chaniioUlipf) 



(21) 



if T n > /? c hannei(bpf) + B^ &x - B € n _ x 

< J n . otherwise. . 



D. Target Rate for Regular Frames 

Once a target number of bits for a super frame n, T„, has 
been set, what remains is to distribute the bits over the regular 
frames in the super frame. By the same methodology, we can 
have the target number of bits for frame 77 of program /. 7i, n , 



1 



T it n 



(22) 



where the numerator on the right is the complexity measure for 
frame n of program / and the denominator is the complexity 
measure for super frame n. The distribution of T u assigned 
for super frame n over L regular frames in the super frame 
is again based upon the complexity measures of the program 
frames. 

E. Constraint on Target Rate for Regular Frame 

In multiprogram broadcast applications, several video pro- 
grams arc multiplexed on a single fixed-rate transmission 
channel. Service information included in the bitstream would 
provide the necessary navigation information to allow the 
viewer to select the desired programs and the settop to turn 
onto the proper channel and extract (demultiplex) the packets 
corresponding to the selected programs, as shown in Fig. 8. 
The demultiplexed bitstream is at a variable rate. To ensure 
that the decoder buffer does not overflow or underflow when 
any of the programs are selected, we need to apply, at the 
encoder, additional restrictions to each individual program's 
bit rate. 

Assume that program / is selected and that the decoding 
delay is A" frames. Let 7?f tl be the number of bits transmitted 
for program / during the nth frame period. The decoder buffer 
will therefore be filled up to 



(23) 



before any bits are moved out. During the period of frame 
71 + -V > A'', the decoder buffer moves R i n t bits to the 
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decoder and receives /?f n _ v bits from the network, as shown 
in Fig. 8. The buffer fullness at frame n+A" > A' r/ is therefore 
given as 

# = B'i_ , + - 71,.,., n = 1 , 2, . . . (24a) 

n ri 

n'=l n' = l 

A" n+A" n 

= E + E - E *»■-•.« ( 24c ) 

"'=1 n' = l + .V n ' = l 

n-r.V' /n-1 \ 

= £ 7£ n . + £ H c n , = 1 fi,^., - i? Ui/ (24d) 

n'=n \n' = l / 

n + A" 

= £ Rl n . - Bf n _ i - K linif (24c) 



where 



frame, i.e.. 

(30) 

Hence, before starting to encode each frame n of program /. 
we need to check if its target rate is within the proper range, 
and if not, we adjust it as follows: 

ii -r A'' 

E ^n.-^n-,-^, 
n'=Ti 

> {T >»< E ^,-Bf^-B^ 
n'=n 

n + .V 

E ^n<-*?,n-, ; 



- < 



flf.- = E *<.«'■< - E R u 



(25) 



n'=l 



if 7-<,,> E *r,-i 

n ' ss n 

k T/. n . otherwise. 



can be considered as the fullness of a virtual encoder buffer 
for program / at frame n. Let tf£ ax be the maximum decoder 
buffer size. To ensure that the decoder buffer never overflows 
or underflows, the buffer fullness B* has to be within the 



1.31) 



range of [O.BjLj. i.e.. 



0<Bt< 2?!,. 



(26) 



From (24) and (26). we have 

M-.V 



0< E Bh,.-Bl n _ x -/?,„., <Bf nAX (27) 



or 

n-f \" 



E ^■•■Br.i.-i-flLx < E R tn-Bi n _ v 

(28) 

This is a constraint on the number of bits generated for frame 
n of program /. Again, we assume that the bit rate for each 
program can be controlled to meet its target rate, i.e.. 



(29) 



Here, Bf n _ ] is. the fullness of a virtual encoder buffer for 
program / at frame n- 1 and is available at frame u. However. 
^,n" «' = n+1, . . - , n-A". are the number of bits that will 
be transmitted for program / during the intervals of the current 
and future frames n, n+1, . . . . u+ A' v . Fortunately, in our case, 
the channel rate /? c h*ni.ei(|.bf; is constant in bits/frame. So. we 

are able to measure these /?{"„.. n # = -« 1 n + A**'. 

in the encoder buffer, as shown in Fig. 9. Within each slot 
window n = n z n -f 1, . . , n + A*' of £ hlinnilUb|jn bits in the 
encoder buffer, the number of bits for program / is /?f* n . Note 
that R c ( ti is not necessary nonzero. 

F. Constraint on Max and Min Rate 

We can also control the average bit rate over a certain 
number of frames by limiting the target number of bits for 
each frame within a specific range. Lei and /?„..,. be the 

allowed maximum and minimum average bit rates over .V" 
frames. The average bit rate of A'" frames up to a frame n 
therefore has to be in the range of [/? mill . /? n;nv J. i.e.. 



The constraint on the bit rate for each individual frame (l.n) 
becomes the constraint on the target number of bits for the 



] 



(32) 



= n~,V 
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Leng;h 


15 


15 
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TABLE il 
Programs Included at Each Stage 


L 


Avers 


Hook 


Martin 


Flower 


Mobile 


Te> m is 




Yes 


Yes 


Yes 








4 


Yes 


Yes 


Yes 


Yes 






6 


Yes • 


Yes 


Yes 


Yes 


Ye> 


Yes 



III. Simulations and Results 

In evaluating the bit-allocation strategy, simulations were 
carried out with a large number of lest video programs. All 
the lest materials had a spatial resolution of 720 x 480 with 
a color-sampling ratio of 4 : 2 : 2. interlaced ai a frame rate of 
30 Hz. The programs were encoded using the MPEG-2 main 
profile/main level syntax [1]. Table I shows the GOP structures 
used for the test sequences in the simulations. Some sequences 
have a GOP length of 18 frames, while the others have a GOP 
length of 15 frames. There are two B-pictures between I- or 
P-pictures. 

For comparison purposes, the sequences were coded both 
independently and jointly. In independent coding, each se- 
quence was encoded separately, at a bit rate of ? Mbiis/s. 
using the TM5 rate control (2). In joint coding, the proposed 
bit-allocation strategy is used. To examine how the picture 
quality varies with respect to the number of programs, joint 
coding was performed with three different conditions — i.e.. 
L = 3.4. and 6 programs — at total bit rates of 3L Mbits/s. 
respectively. Sequences Aycrs. Hook, and Martin were first 
grouped in a three-program joint coding at a total rate of 
9 Mbits/s. The sequence Flower was then added to form a 
four-program joint coding, at a total rate of 12 Mbiis/s. Two 
more sequences. Mobile and Tennis, were then included for a 
six-program joint coding. Table II summarizes the programs 
included at each step. 

A. Simulations 

Fig. 10 shows the block diagram of joint coding simulations. 
At each frame period, each MPEG-2 encoder, say. /, receives 
a target number of bits T; from the rate-control engine. The 
target number of bus for each frame is then met by adjusting 
the quantization parameter in- each MPEG-2 encoder. The 
resulting number of compressed bits (/?•)■ and the average 
quantization parameter (C\ ; used for the current frame of a 
program are sent to the rate-control engine, and their product is 
used to update the complexity rheasure for the corresponding 
picture type. The rate-control engine then determines a new 
target number of bits for the next frame, using the proposed bit- 
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- Y, *'•"'•<• (33) 

n' = n-.V" + l 

This is an additional constraint on the number of bits for frame 
7i of program /. Note that n' = n-N" ,n-N" , . . . ,n- 

1. are all available at frame n. Again, we assume that we can 
make the actual bit rate close to the target rate by proper rate 
control, i.e.. ' 



7^.1 = Ri.n.t- 



(34) 



The additional constraint on the actual bit rate for frame n 
then becomes the constraint on its target rate, i.e., 



ri-1 

- J2 (35) 



If necessar\. we adjust the target rate for each individual frame 

as follows: 



u - 1 



n ' = n - .V," 

if 7V „ < :V'7i\ mr . - 



„•-„_ X"- ] 

Tt — i 

if Ti.„ > .V"/?^ - 
k 7, „. otherwise. 



'=..-.v; 



(36) 
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3.00 
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3 00 



4.)f> 



2.S4 



TABLE IV 

PSNR (dBj for Both Independlm a.v» J«.:sr Cows::- 



Coding 




Hook 


Ma run 


FU*w cr 


Mobile 


T cnr.is 


Indep. 


32.07 


32.S9 


35.91 


27.52 


25.94 


29.66 


Joint 3 


3^.22 


34.43 


35.43 








Joint 4 


33.56 


33.87 


34.90 


31.43 






Joint 6 


32.76 


33.16 


34.27 


29.94 


2S.70 


30.00 



u;pcr GOP => super trame => r'ramr 



program / j 



-fj Encooer j- 



M-4 



_K,Q : 



procram c 



Buffer 



Channel 



rig. 10. Block diagram of joint coding with proposed joint rate. 



allocation strategy described in Section II. The bits generated 
by the MPEG encoders are multiplexed and moved into the 
encoder buffer for transmission. 

To achieve the target hii raic for each regular frame, rate 
control has to be implemented. TM5 [2] describes a rate- 
control scheme thai adjusts the quantization parameter at 
each maeroblock according to the buffer fullness. Since the 
quantization parameter varies from tnacroblock to maeroblock, 
nonuniform picture quality may result. For example, two 
identical macroblocks in a frame may be assigned two different 
quantization parameters becau.se of the different buffer fullness 
at different times. An alternative is to use a global quantization 
parameter lor the entire frame, li has been shown [9] that the 
use of a unique quantization parameter over a frame results 
in a more uniform picture quality as compared to TM5. The 
global quantization parameter can be adjusted on a framc-by- 
irame basis so that the output bit rate for a frame can be close 
io the target bit rate for the frame. In our simulations, we 
used a binary tree search algorithm to determine the slobal 



quantization parameter for a frame [9]. To exploit spatial 
masking, the global quantization parameter can be further 
modulated for each maeroblock by the respective local spatial 
activity (as in T\15 |2J). It has been shown. [9] that the 
global rate control provides much better and more accurate rate 
control than TM5. The simulation results in [9] demonstrated 
that the deviation of the actual rate from the target rate by- 
using the alternative is about 70-809*- less than with TM5. 

B, Results 

Table III shows the bit rales for the test sequences for both 
independent and joint coding. In independent coding, all the 
sequences were coded at the same rate of 3 Mbits/s. However, 
in joint coding, the sequences were coded with quite different 
numbers of bits, depending on program contents. For example, 
with six programs, jointly coded, sequence Mobile used more 
than twice as many bits as sequence Martin. 

Table IV shows the corresponding PSNR for each sequence. 
Compared with independent coding, joint coding produces a 
much smaller difference in PSNR between programs, even 
with a small number of programs. The variations in PSNR 
between programs depend on program contents. 

Joint coding also produces a more uniform picture quality 
within programs. Table V shows the quality variation (.variance 
of the PSNR) for the test sequences. It should be noted thai 
even with a small number of programs, as few as three, joint 
coding significantly reduces time variations of picture qualitv. 
especially for programs that exhibit large variations in program 
content, such as Ayers, Hook. Martin, and Tenuis. Simulations 
with more (eight and ten) programs have shown similar results, 
with only marginal improvements. 

Figs. 11-13 shou the comparisons in PSNR versus frame 
number for independent and joint coding with a different 
number of programs \L ;J. •!.(»;. It is clearly demonstrated 
that the picture qualii\ tends to be much closer with joint 
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TABLE V 

Variance in PSN"R for Bom Imjephsdevt and Joint Cooing 



Coding 


.Aver j. 


Hook 


Mariir. 


Flower 




Tennis 


1 r.cicp. 


3.59 


6.13 


3.5* 


0.74 


0.23 


10.36 


Jon i 3 


1.13 


1.57 


1 0" 








Joint 4- 


0.82 


1.31 


o.ss 


o.so 






Joint 6 


0.84 


1.53 


0.71 


0.79 


0.66 


4.90 
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Fig. 11. PSNR idB) for three sequences (Ayers. Hook, and A/or/in). (a) F'? 12- PSNR (dB) for four sequences t.4ycj. f tower. Hook, and Afur/m). 
Independent coding, each at 5 Mbits/s. (b) Joint coding at a total rate of Independent coding, each at 3 Mbits/s. ib; Joint coding at a total rate of 12 



9 MbKs/s. 



Mbiis/s. 



coding than with independent coding. Also notice the smaller 
variability in PSNR within each program by joint coding as 
compared to independent coding. 

IV. Conclusions 

In this paper, we examined the bit-allocation issue for joint 
coding of multiple video programs and formulated a bit- 
allocation strategy of a hierarchical nature. At the very top 
level, we introduced the concept of a super GOP so that the 
programs can be reorganized into a set of identical groups 



of super GOPs in terms of the number of frames of each 
picture type. The identical super GOP's are assigned the same 
number of bits. We further defined a super frame as a collection 
of frames, each from one of the programs taken at the same 
time instant. We formulated the bit-allocation strategy for the 
super frames within a super GOP. given a target number of 
bus for the super GOP. We then formulated the. bit-allocation 
strategy for regular frames in each super frame for a given 
target bit rate for the super frame. To prevent the encoder and 
decoder buffers from overflowing and underflowing and to 
limit each individual bit rate within a specific range, we added 
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Pig. I ?. PSNR (dB > for si* sequences (.4 wrj, f lower. Hook. Martin. Mobile 
and Tennis), (a) Independent coding, each at 3 Mbits/s. (b) Joint coding at a 
total rate of 18 Mbiis/s. 



constraints on the target bit rates for both super frame and 
regular frame. To meet the target bit rate for each frame, we 
used the rate-control scheme described in [9]. The simulation 
results demonstrate that joint coding, using the proposed bit- 
allocation strategy, results in more uniform picture quality 
among programs as well as within a program, compared to 
independent coding. In other words, joint coding is able to 
improve channel utilization by dynamically distributing the 
channel capacity among programs according to their respective 
complexities. Furthermore, the benefits of joint coding are 
achieved even with a small number of programs. 
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